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Abstract — This paper studies the successive convex approxima- 
tion for low complexity (SCALE) algorithm, which was proposed 
to address the weighted sum rate (WSR) maximized dynamic 
power spectrum management (DSM) problem for multiuser 
multicarrier systems. To this end, we first revisit the algorithm, 
and then present geometric interpretation and properties of 
the algorithm. A geometric programming (GP) implementation 
approach is proposed and compared with the low-complexity ap- 
proach proposed previously. In particular, an analytical method is 
proposed to set up the default lower-bound constraints added by 
a GP solver. Finally, numerical experiments are used to illustrate 
the analysis and compare the two implementation approaches. 

Index Terms — Dynamic spectrum management, power control, 
cochannel interference mitigation, convex optimization, geometric 
programming, orthogonal frequency division modulation, digital 
subscriber lines. 

I. Introduction 

Weighted sum rate (WSR) maximized dynamic power spec- 
trum management (DSM) has lately been attracting much 
research interest for multiuser multicarrier systems. Optimum 
spectrum balancing (OSB) algorithm was first proposed based 
on dual decomposition f\\. When there is a big number of 
carriers, the global optimality of this algorithm was justified 
in 0, ||3|, by showing that the duality gap of the problem 
approaches zero asymptotically as the number of carriers goes 
to infinity. A more efficient algorithm, referred to as iterative 
spectrum balancing (ISB), was proposed in ||2l, H. 

Recently, a successive convex approximation for low com- 
plexity (SCALE) algorithm was proposed based on the idea 
of solving convex approximations of the original problem 
successively for increasingly better solutions ID, IS). This 
algorithm also proved to be very useful to address various 
resource allocation problems for interference mitigation Q- 
Compared with the above existing works, this paper makes 
the following contributions: 

• Novel geometric interpretation and properties are pre- 
sented for the SCALE algorithm. Most interestingly, we 
show that the algorithm is asymptotically optimum, i.e., it 
produces power allocations with WSRs approaching the 
maximum value as long as a sufficiently good initializa- 
tion is used, even though the produced power allocations 
are entrywise positive while the optimum one might 
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contain zero entries. This property is also illustrated by 
numerical experiments. 
• A geometric programming (GP) approach is developed to 
implement the SCALE algorithm. This approach reveals 
the fact that, each convex approximate problem for the 
SCALE algorithm is actually a GP, thus a GP solver can 
be exploited to pursue its global optimum. A subtlety is 
that default lower-bound constraints are added in the GP 
solver to avoid overflow (see Appendix). For GP based 
power allocation algorithms reported in the literature, the 
incurred loss of optimum objective value as well as how 
to set up these constraints were however not discussed 
m, IS). In view of this context, these aspects are studied 
for solving the DSM problem with the GP implemen- 
tation of the SCALE algorithm. The GP approach is 
also compared with the low-complexity implementation 
approach previously proposed in jS]. 

The rest of this paper is organized as follows. First, the 
system model and DSM problem are described in Section II. In 
Section III, the SCALE algorithm is revisited. Then, geometric 
interpretation and properties are presented in Section IV. 
After that, the two implementation approaches are shown in 
Section V. In Section VI, numerical experiments are given. 
Finally, some conclusions and future research directions are 
summarized in Section VII. 

Notations: A vector is denoted by a lower-case bold letter, 
e.g., X, with its i-th entry denoted by [x]i. A matrix is 
denoted by an upper-case bold letter, e.g., X, with [X]y 
denoting the entry at the i-th row and j-th column. Xi >~ X2 
(respectively, Xi ^ X2) means that Xi is entrywise strictly 
greater (respectively, greater) than X2. and e-'^ represents 
the vector and matrix which are entrywise mapped from x and 
X through the exponential function, respectively. Vxy(X) 
stands for a matrix containing entrywise derivative of y(X) 
with respect to X. 

II. System model and DSM problem 

Consider the scenario where K transmission links, each 
using carriers, communicate simultaneously with cochannel 
interference. Each of the transmitters and receivers is equipped 
with a single antenna. Transmitter k {k = 1, - ■ ■ , K) encodes 
its data and then emits them over all carriers to receiver fc, with 
Pkn being the transmit power for carrier n (fc = 1, • • ■ ,K, 
n = 1, • • • 1^)- The channel power gain at carrier n from 
transmitter / to receiver k is denoted by Gkin- We assume 
Gkin > Q,y l,k,n.lt is assumed that each receiver decodes its 
own data by treating interference as noise, and every coherence 
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period in which all channels remain unchanged is sufficiently 
long. There exits a spectrum management center (SMC) which 
first obtains {Gkin\^k,l,n}, then executes a DSM algorithm, 
and finally assigns the optimized power spectra to transmitters 
for data transmission. 

To facilitate analysis, we stack all power variables into the 
matrix P with [P]fe„ = pfc„ and its ?7-th column is denoted 
by p„. The signal-to-interference-plus-noise ratio (SINR) at 
carrier n of receiver k can be expressed as 7fc„(p„) = j^f'j^^, 

where /fc„(p„) = + Y^f^ij^k^kinPin represents the 
interference-plus-noise power received at carrier n for receiver 
k, and af,,^ is the power of additive white Gaussian noise 
(AWGN). Gn is a matrix with all diagonal entries equal to 
zero and [G„]fe; — if k ^ I. We make a mild assumption 
that V n, Gn is primitive, i.e., there exists a positive integer i 
such that (G„)VO (see Theorem 8.5.2 of ifTOl ). 
The WSR maximized DSM problem is 

K N , s 

max 5(P) = E E log (1 + ^^^) 



N 



where F > 1, Wk, p^, and represent the SINR gap 
between the adopted modulation and coding scheme and the 
one achieving channel capacity, the prescribed positive weight 
for receiver fc's rate, the sum power available to transmitter fc, 
and the power spectrum mask imposed on pkn, respectively. 

III. Revisit of the SCALE algorithm 

It is difficult to find a global optimum for ([T]i since .g(P) is 
neither convex nor concave of P. Initialized by a feasible P(^\ 
the SCALE algorithm circumvents this difficulty, by solving 
convex approximations of ([TJ successively. To facilitate de- 
scription, a superscript m put to a variable indicates that it 
is associated with the mth iteration of the algorithm hereafter 
{m > 1). Specifically, p(™+i) is found as a global optimum 
to an approximation of (HJ, i.e.. 



max 5(™)(P) =5(P(")) +E 



k.n ^ ikn 



log 



7fcn(PrO 



N 



Pkn G[0 Vfc,n, (2) 

where g(™'(P) is a lower bound approximation of g{P) with 
tightness at P P(™), i.e., V P, g{P) > g'^"'\P) and 
(^(p(™)) = (^(™)(p(™)) as proven later Based on this prop- 
erty, ^(P^^+i)) > g("')(p(™+i)) > 5(™)(p(™)) = g(p(™)) 
follows, ensuring that {5(P("))|V m} IS an increasing se- 
quence. Therefore, f/(P^™^) must converge as m increases. 

Note that g^™\P) is not concave of P. Nevertheless, 
after replacing P with P = e*^ where Q contains log- 
power variables ([Qjfcn = Qkn), g^^^\e^) is concave of Q, 
because log(7fe„(q„)) is concave of 

according to Lemma 1 of [J] . This means that after the change 
of variables, ^ can be solved by state-of-the-art convex 



optimization methods. The idea behind the SCALE algorithm 
is illustrated in Figure [T] 




,,(...) (,Q, 



(a) 



(b) 



Fig. 1. Illustration of the idea behind the SCALE algorithm, where Cp is 
the feasible set of (T). 

In 13], ||6l, some theoretical analysis has been made to show 
that when ^(Pt^+i)) = g(P<'")), P^™' satisfies the Karush- 
Kuhn-Tucker (KKT) conditions for ([T]i. In the next section, 
we will present geometric interpretation and more properties 
of the SCALE algorithm. To this end, we first revisit the 
derivation of g^™'' (P) as a lower bound approximation of g{P) 
with tightness at P = p(™) jgl. To facilitate description, 
the log-SINR variable for carrier n of receiver k is defined 
as (pkn^ ^nd all (pkn, V k,n are stacked into the matrix $ 
with [^]kn = (j^kn- The $ corresponding to Q is denoted by 
$(Q). [$(Q)]fen is a function of q„, and denoted by (f>kn{<ln) 
hereafter. Note that (j)kn{thi) = log(7fcri(qri)) a concave 
function of q„ as said earlier. The feasible sets of Q and $ 
are defined as Cq and C$ = {$(Q)|Q G Cq}, respectively. 
It is very important to note that an analytical expression for 
Ccj) was given in ifTTl . with which it can be proven that C$ 
is an unbounded convex set by using the log-convexity of the 
spectral radius function of a nonnegative matrix. 

The WSR when expressed as /($) = J2k n log(l + 
^^^^), is strictly convex of $. Thanks to this property, the first- 



order Taylor approximation of /($) around 
expressed as 



$(Q('»)), 



/(")($) =/(<i>M)+^ 



Dkn 



k,7i 



')kn 



(3) 



where 0^^^ = [$(")]fc„, is a lower bound approximation 
of /($) with tightness at $ = $('"), i.e., V /($) > 
/('")($) and = /(™)($(™)). Note that .g'^^P) = 

/("')($(P)) where $(P) denotes the $ corresponding to P, 
indicating that ^^'"^(P) is indeed a lower bound of g{P) with 
tightness at P = P^"), 



IV. Analysis of the SCALE algorithm 
A. Geometric interpretation of the algorithm over 

The derivation of g'™' (P) inspires fundamental insight that, 
the SCALE algorithm actually exploits the convexity of the 
WSR with respect to $ and the entry wise concavity of $(Q) 
with respect to Q, to iteratively look for increasingly better 
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$ e C$. At the mth iteration, the first-order Taylor approx- 
imation of /($) around is used to construct /''"■'($) 
as a lower bound approximation of /($), by exploiting the 
convexity of /($) with respect to $. Then, a <!> maximizing 
over C$ is found by solving (O and assigned back to 
so as to ensure that > > 
/('")($(™)) = /($(™)). Note that this idea of formulating the 
WSR as a function of $, and then iteratively maximizing its 
first-order Taylor approximation has also been used in ifTTI - 

m. 




Fig. 2. Illustration of looking for <!>('"+ ^) over C$. 

When viewed over C$, the SCALE algorithm can be inter- 
preted in a very simple way. To this end, note that the mth 
iteration is to find 



$(m+i) ^ y- 



k.n 



»feri 



(4) 



i.e., is the 4> G C$ with the maximum projection of 

(() _ $('") toward the direction specified by V$/(<I'(™)) as 
illustrated in Figure |2] It can readily be shown that $('"+1' 
must belong to C$'s Pareto-optimal boundary (POB) denoted 
by C|, due to the fact that V £ C*, V$/($(")) ^ 0. 

Specifically, consists of all $ 6 for which there 
does not exist a $ G C$ satisfying $ >- $ ||T4!|. Every 
point belonging to C§ represents a best tradeoff among (j>kn, 
\/ k,n for maximizing the sum of them weighted by certain 
coefficients. Moreover, the iterative search has the following 
property: 

Lemma 1: If ^ $(™), then > /($<")). 

Proof: From the strict convexity of /($) with respect 
to $, /^™^($) is a strict underestimator of /($) except 
at $ = Therefore, > /(™)($(™+i)) > 

— /($(™)). ■ 

In a word, the SCALE algorithm iteratively searches over 
to produce with strictly increasing WSR as m 

increase^ 

' As proposed in (6j, the SCALE algorithm can be generalized to maximize 
the WSR of rate-adaptive (RA) users penalized by the sum power consumption 
of fixed-margin (FM) users. In such a case, it can be shown that, the 
SCALE algorithm can be interpreted as iteratively looking for <I> which 
maximizes the weighted sum of all corresponding to the RA users 

penalized by the sum power consumption of the FM users, over a convex 
subset of Ccj. dependent on <!>('"). 



B. Properties of the algorithm at convergence 

Note that a fixed point denoted by $' for the iterative 
operation (HI of the SCALE algorithm must satisfy 



$ = arg max > 0fc„ 



(5) 



Let us collect all fixed points for the SCALE algorithm in 
C|. It can readily be shown that C| must be part of the POB 
of C$, i.e., C| C C^. Moreover, every $' e C| is a stationary 
point of /(<&) over C$, because 



k.n 



'kn 



(f>kn) > 



(6) 



is satisfied (see the definition of a stationary point in page 194 
of ifTSl ). Since is convex, all local maximum of /($) over 
C$ must belong to C| according to Proposition 2.1.2 of ifTSll . 
Moreover, the following theorem can be proven: 
Theorem 1: The following claims are true: 

1) If /($('"+!)) = then $('"+!) = $('") G C| 
holds and p(™+i) ^ p(™) satisfies the KKT conditions 
of O. 

2) Provided that 



> max /($) 



(7) 



where denotes the set of globally optimum <& over 
C$, then lim„j^_|-oo /(^'■™-') = 9* where g* is the 
maximum WSR for ([T]). 

Proof: To prove the first claim, suppose ^ (()('"). 

From Lemma [fl /($("+!)) > /($('")) follows, leading to a 
contradiction. Thus, = $(™) e C|. Since $(P) is a 

one-to-one mapping as shown in ifTTll . lfT2l . p(™+i) is equal 
to p('"), and they satisfy the KKT conditions of ([T]l as proved 
in 13. 

To prove the second claim, two cases are examined. In 
the first case, an m satisfying /(<1'^™') = exists. 
Suppose ^ CI, then e C| - follows, meaning 
that > This is a contradiction with the fact 

that /(<I'^™^) is strictly increasing with m, thus e Q 



and lim 



(m)>l 



= g* holds for the first case. In 
the second case, 

/($(™)) strictly increases endlessly as il- 
lustrated by a numerical example in Section VI-A. According 
to the monotone convergence theorem, ./(<&^™-') approaches 
the supremum of {/($)|$ S C|}, i.e., lim,„_^+oo /($^'"^) = 
sup^GC^ /($) = g* lfT6l . Therefore, the second claim is true. 

■ 

The first claim indicates that when ^ /($('")), 

the SCALE algorithm reaches a fixed point in C|, and p(™) 
(respectively, $(™)) will remain there for all following itera- 
tions, i.e., it never happens that /(P'™)) remains fixed while 
p(m) jjggpg oscillating among different values as m increases. 
Nevertheless, it may happen that /(P^™)) increases strictly 
and endlessly as illustrated by a numerical example in Section 
VI-A. Therefore, the SCALE algorithm can be terminated 
when either prescribed M iterations are already executed or 
■p{m+i) ^ p(m) satisfied. 



4 
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According to the second claim of Theorem [T] the SCALE 
algorithm is asymptotically optimum, i.e., it produces p('") 
with WSR approaching g* as long as /(P*^^') is above the 
threshold value on the right-hand side of (|7]i, even though 
V m, p(™) is entry wise positive while the optimum P for 
([T]i might contain zero entrie^. This will be illustrated by 
a numerical example in Section VI-A. Note that the above 
condition is generally true to guarantee global optimality. It 
was also pointed out in ifTTI - lfTSll that for the single carrier 
case, the SCALE algorithm always converges to a global 
optimum when the optimum <I> and {wfc|V k} satisfy some 
special conditions. 

V. Approaches to implement the SCALE algorithm 

The key to implementing the SCALE algorithm is to solve 
the ?Tith iteration problem: 

max ^ 4"' (gffc„ -log (crL +51'^'='"^*")) 

N 

s.t. ^e"="<p^,Vfc; e^''" <Pfc„,Vfc,n, 



where a 



(m) _ tm.7l" 



WkGk 



kn 



r+7 



Gfcfc„ii^+r-/fc„(p(")) 



In the follow- 



ing, we first present the low-complexity approach proposed 
in Is), and then the GP approach to implement the SCALE 
algorithm. Finally, the two approaches are compared. 

A. The low-complexity approach 

Algorithm 1 The low-complexity implementation approach 

m = 0; V fc, n, = Wk, Pkn = 0; 
repeat 
repeat 

V k, fjLk = if rjfe„(P, 0) < Pf., otherwise search 
the bk > satisfying J2n f^fen(P) ^fe) = Pk with the 
bisection method, and pk = bk', 

V k, n, update pkn with r2fe„(P, pk); 
until P converges or L iterations have been executed 
TO = TO + 1; p(™) = P; compute al^\ V fc, n; 

until TO = M or P(") = p(™-i) 
output P'" as a solution to ([T]i. 

A low-complexity approach was proposed in jSj to solve 
with KKT conditions based fixed-point equations and the 
bisection method for updating P and Lagrange multipliers, 
respectively. To facilitate description, the Lagrange multiplier 
for transmitter fc's sum power constraint is denoted as pk- The 
KKT conditions require the P corresponding to the optimum 
Q and the optimum Hk, V fc to satisfy: 



V fc, n, Pkn = ^kn(P, fJ-k), and V fc, pk 



Pkn - Pk 



= 0, 



^ This happens under certain conditions shown in 1171 . However, it still 
remains an open and challenging problem to decide which users and carriers 
should be shut down. 



where f]fe„(P,^fe) 



Mfc+E,^tK*r'G,A,„//,„(p)) 



and 



[x]y = max{y, min{a;, z}}. The low-complexity approach is 
summarized in Algorithm [T] 

B. The GP approach 

In fact, it can be shown that (O is equivalent to 



mill loff I e 
Q 



-qk 



S.t. log 
log 



J_g9k„) < o,Vfc, 

Pk 

[Gkine"'"-''^ 



l^k 

logiJ-e'}"") < 0,Vfc,n. 

Pkn 



'kn'' 



^) <0,Vfc,n, (9) 



where {tfc„|V fc, n} is a set of extra variables introduced to 
guarantee equivalence. The GP approach to implement the 
SCALE algorithm simply uses a GP solver, to solve (|9]l for 
Qm+i |-ggg Appendix). 

A subtlety deserving special attention is that, an extra lower- 
bound constraint on every qkn, expressed as q^n > log(0 
where ^ can be preassigned, is added by default in the GP 
solver to avoid overflow|j. Denote the feasible set of $ by 
C$(<^) after the extra constraints pkn > k,n sae added. 
It can be seen that after adding the extra constraints, the GP 
approach iteratively searches over {£,) representing the POB 
of C$(^) to produce p('") with strictly increasing WSR. 

The GP approach actually implements the SCALE algo- 
rithm to solve ([T]i with the extra constraints. It is interesting to 
evaluate the incurred loss of the maximum WSR for ([T]i due to 
the extra constraints. Obviously, g*{£,) < g* where g*{£,) is the 
global optimum for ([T) with the extra constraints. Note that ^ 
might have multiple globally optimum solutions, one of which 
is denoted by P* hereafter. If there exists a P"^ satisfying 
P* >z 9* follows. Otherwise, g*{£,) < g*, meaning 

that a loss of the optimum WSR is incurred by the extra lower 
bounds. In practice, it is rarely known a priori if there always 
exists a P* satisfying P* >^ ^. To evaluate the worst-case loss 
of the optimum WSR, we present the following theorem: 



Theorem 2: Suppose < ^ <C 



Pk 



N{N+1) 



g*>g*iO>9*~mK^¥'a^ 



then 

wiG^ 



kd.n 



(10) 



'In 



Proof: To prove the claim, two cases are examined. In 
the first case, there exists a P* >^ ^, thus ( fTOb follows since 
g*(^) ~ g*. In the second case, there exists at least an entry 
smaller than ^ for any P"^, and we will prove the validity of 
( fTOb as follows. 

Let's first choose a P*, from which we will construct a P" 
very close to it and feasible for ([T]i with the extra constraints 
in two steps. In the first step, all entries in P* smaller than ^ 
are raised to be ^. The resulting power allocation is denoted 
by P'. Clearly, the total increase of every transmitter's sum 

^ As for tfe„,V k,n, the idle constraint tj.„ > log((T|^), i.e., it is 
always relaxed since the second constraint in (9) should be satisfied, can 
be preassigned to the GP solver. 
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power is not higher than N£^. In the second step, all rows 
of P' are examined. For the fcth row, no entries is updated 
if J2n [P']fen — Pk- Otherwise, only the maximum entry, 
i.e., [P']fe„^ with rifc ~ argmax„[P']fe„, which must satisfy 
[P'lferi^ > 7^ > + is reduced by N^,. The finally 
produced power can be taken as P" 



It can be shown that \/ k,n. 



dgjP*) _ dC„{Pn 
dpk 



dD„{P*} 



where C„(P) = wi log(G/,„p/; + r/,„(P^)"and dJp) = 
J2i log(r/i„(P)). Therefore, 



dpkn Pkn 



< K max 



E WlGlkn 

l^k '"^ 
WlGlkn 



> — A^max 



Ikn 



U.n af^^ 



Obviously, g* > g*{£,) > g{P")- Since ^ is very small, 
g{P") can be computed according to the first-order Taylor 
approximation around P* as 



g{P")^g*+J2 (- 

keSk \ 



- dfjP*) 

dp. 



kuk 



y , df{P*) 



where S is equal to either or iV^, k belongs to Sk if the 
entries in P* corresponding to transmitter k are modified to 
get P", and Skn is the set of carrier numbers n satisfying 
[P*]kn < Therefore, 

WlGlkn ^T^TT^ WlGlkn\ 

-oK max 2 s-'^^ A max = — I 

WlGlkn 



5(P")>5* _ ^ 

>g*- 2^NK^ max 



E 

keSk 



k,l,n (J 



In 



meaning that the claim holds for the second case as well. ■ 
According to the above theorem, ^ should satisfy ^ < 

e (2NK^ maxfc^;,„ ""^'"" ^ to ensure that g*{^) > g* - e, 
where e > is prescribed and represent the maximum 
tolerable loss of the optimum WSR. Algorithmic summarizes 
the GP approach to implement the SCALE algorithm, where 
a ^ satisfying the above condition is used to configure the GP 
solver. 

Algorithm 2 The GP implementation approach 



m — 0;\/ k, n, 



Wk, 



repeat 

solve ^ for Q(™+i) by a GP solver configured with a 

i < e (2NK^ maxfc,z,„ 

p(m+i) _ gQ'"+^'. TO = 777,^1; compute a[™', V k,n; 
until m = M or P("') = p(™-i) 
output p(™) as a solution to ([Til. 



when C| and g* are replaced by C|(^) and g*{£,), respectively. 
This means that as long as a sufficiently good initialization 
is used, the GP approach still produces p('") with WSR 
approaching g*{£,) asymptotically. 

The low-complexity approach uses a heuristic rule to pursue 
the combination satisfying the KKT conditions of Once 
converged, the inner iterations must output a global optimum 
for ^ as p('"+i). If the convergence of inner iterations 
could always be achieved, the low-complexity approach would 
become a faithful implementation of the SCALE algorithm. 
However, it is unclear if the inner iterations always converge 
as L increases, since a theoretical proof is difficult and has not 
been available yet. Nevertheless, numerical experiments show 
that the convergence is always observed when L is sufficiently 
large for practical channel realizations lH. In practice, it is 
very attractive to implement the SCALE algorithm with the 
low-complexity approach using a very small L. In such a 
case, the inner iterations might often not converge, and the 
low-complexity approach might have very interesting behavior, 
e.g., the produced y(p('"+i)) might be either smaller, or even 
greater than the WSR of the optimum for (|8]l, as will be 
illustrated later 

Note that the GP approach with a very small ^ can be re- 
garded as a close approximation of the faithful implementation 
of the SCALE algorithm. Therefore, the GP approach can be 
used as a benchmark to evaluate the low-complexity approach. 
In Section VI, we will further use numerical experiments to 
compare the GP approach and the low-complexity approach, 
and show the behavior of the low-complexity approach when 
L varies. 

VI. Numerical experiments 
A. A simple numerical example to illustrate analysis 

We first consider a simple scenario with K = 2 and = 1 
because a faithful implementation of the SCALE algorithm can 
readily be made in this scenario. The parameters are set as F = 
dB, V /c, n, p;, = pj.„ = Gkkn = = 1- Since a single 
carrier exists, the carrier-number subscript of every variable is 
omitted for simplicity hereafter. The analytical expression C$ 
and C$(^) can readily be derived and thus not shown here due 
to space limitation. 

Figure [3] shows the results when the first set of channel 
and weight parameters in Table |T] is used. For the faithful 
and low-complexity approaches illustrated in Figure [3]a, 
is plot as the line asymptotically extending to (— cx), 0) and 
(0, —oo), respectively, and C$ is the unbounded region below 
that line. For the GP approach, ^ = 0.05 is chosen, and the 
POB and the lower-boundary for C$(^) are plot, with C$(^) 
being the region enclosed inside. Obviously, C$(^) C C$ and 
C C^, respectively. C$ is convex, which illustrates 



C. Comparison of the two approaches 

As said earlier, the GP approach iteratively searches over 
to produce p(™) with strictly increasing WSRs, i.e.. 
Lemma [T] still holds. Therefore, the GP approach has guar- 
anteed convergence as m increases. Note that C$(^) might 
be nonconvex as illustrated later. Theorem [T] also remains true 



TABLE I 

Three sets of channel and weight parameters. 





911 


922 


912 


921 


W\ 


W2 


the first set 


LO 


LO 


0.8 


0.4 


3.0 


LO 


the second set 


LO 


LO 


L8 


0.4 


L8 


LO 
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The faithful implementation The GP approach 




-3 -2 -1 -3 -2 -1 

The low-complexity approach with L=1 The low-complexity approach with L=8 




-3 -2 -1 -3 -2 -1 
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Fig. 3. Visualization of the iterative searches for the approaches to implement 
the SCALE algorithm, when the second set of channel and weight parameters 
is used. 

the validity of the analysis in ifTTI - lfTSl . Most interestingly, 
C$(^) is nonconvex for this parameter set. The globally 
optimum $ in and C$(^) is at {(f)i —0,4)2 ~ —go) and 
= —0.04, (f>2 = —3.33), and corresponds to g* = 2.08 
and (?*(^) = 2.06, respectively. Obviously, they satisfy ( fTOl i. 
thus illustrating the validity of Theorem |2] The faithful im- 
plementation and the low-complexity approach with L = 8 
produce the same sequence of $™ approaching the globally 
optimum $ asymptotically as m increases (the iterations 
proceed endlessly, and only the first 7 iterations are shown 
here for clarity). This illustrates the asymptotic optimality of 
the SCALE algorithm even when the optimum P contains zero 
entries, as discussed in Section IV. This also suggests that the 
inner iterations for the low-complexity approach with L = 8 
always converge to an optimum solution for dHJ, V m. The GP 
approach converges to the globally optimum $ in C$ (^) after 
7 iterations. On the other hand, the low-complexity approach 
with L = 1 produces a different sequence of solutions, 
suggesting its inner iterations do not always converge to the 
global optimum for Nevertheless, the produced still 
approaches the globally optimum $ asymptotically. 

For the second parameter set in Table H] the results are 
shown in Figure |4] For this parameter set, the globally op- 
timum $ in C$ and C$(f) is at {(pi =0,^2 = — oo) and 
{(pi = —0.12, (p2 = —3.32), and corresponds to g* = 1.25 and 
g*{^) = 1.17, respectively. They still satisfy ( fTOl i and illustrate 
the validity of Theorem|2] Similar phenomena can be observed 
as for the first parameter set, except for those explained 
as foUows. The faithful, GP and low-complexity approaches 
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Fig. 4. Visualization of the iterative searches for the approaches to implement 
the SCALE algorithm, when the third set of channel and weight parameters 
is used. 

with i = 8 all produce the same sequence of and 
converge to a fixed point {(j>i = —1.03, (f)2 = —0.34) which is 
locally optimum after 3 iterations. In particular, the /(<i>'^') 
produced by all the approaches does not satisfy condition (|7]i, 
indicating such a convergence to a local optimum is indeed 
possible. Moreover, it is very interesting to see that for the 
low-complexity approach with L = 1, the produced ^'-^^ has 
a higher WSR than that for the other approaches, and the 
approaches the globally optimum ^ asymptotically as 
m increases (only the first 4 iterations are shown here). This 
suggests that the low-complexity approach using a small L, 
even without convergence of its inner iterations, might lead to 
a better solution than the GP approach. 

B. Numerical experiments for a realistic scenario 

We have also conducted numerical experiments for a real- 
istic scenario with K — A and N — 128. The fcth receiver 
is located at the coordinate {x — k,y — 10), whereas the 
A:th transmitter is at {x = k,y = 5) and {x ~ k,y ~ 0) 
if k = 1,2 and k — 3,4, respectively. These coordinates 
are in the unit of meter. The parameters are set as F = 
dB, wi = W2 = 1, W3 = u'4 = 2, V fc,n, (7^,^ = —30 
dBm, = pi^. The channel for every link is generated 
with the channel model explained in ifTSl . |fT9l . Note that 
the transmitted power is attenuated by 30 dB in average 
when received at a distance of 10 meter apart. To evaluate 
the effectiveness of the implementation approaches for the 
SCALE algorithm, the ISB algorithm proposed in ||2l was also 
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Fig. 5. The average WSRs produced by every implementation approacii for 
the SCALE algorithm, and for using the UPA and ISB algorithm, respectively. 

implemented (note that a grid search of 100 points was used to 
optimize every power variable to ensure a good performance). 
We conducted the following experiments with Matlab v7.1 on 
a laptop equipped with an Intel Duo CPU of 2.53 GHz and a 
memory of 3 GBytes. 

We have generated 100 random realizations of all channels. 
When V fc, pj. = 50 dBm, we have implemented for every 
realization the ISB algorithm, the GP and low-complexity 
approaches with different L. The GP solver gpcvx was used 
to solve dU for the GP approach, and ^ = 10^^" is assigned 
II20I . During the simulation, we find that for every channel 
realization, the maximum gikn, V k,l,n is smaller than 10^'^. 
According to Theorem H g*{^) > - 8.19 x lO"'^ follows, 
meaning that the worst-case loss of the optimum WSR is 
negligible. 

The average time spent by the ISB algorithm is around 
600 seconds. The average time for the GP approach increases 
proportionally with M and it is around 100 seconds when 
M = 8, and the one for the low-complexity approach increases 
proportionally with L * M, and it is around 1.5 seconds when 
M = 8 and L = 16. The GP approach is much faster than the 
ISB algorithm, because the interior-point method (IPM) used 
is more efficient than the dual method and exhaustive search 
used by the ISB algorithm. The low-complexity approach is 
much faster than the GP approach, indicating that the heuristic 
update rule used for the inner iteration of the low-complexity 
approach leads to a much faster speed than the IPM used by 
the GP solver 

Figure |5] shows the average WSR of the solutions produced 
by every implementation approach for the SCALE algorithm 
when M < 8. The average WSRs corresponding to using 
the uniform power allocation (UPA) and the ISB algorithm, 
respectively, are also shown. It can be seen that the SCALE 
algorithm implemented with every approach leads to a much 
greater average WSR than the UPA when M > 2. The average 
WSR for the GP approach increases and becomes close to that 
for the ISB approach as M increases, which illustrates the 
effectiveness of the SCALE algorithm. For the low-complexity 
approach with a fixed L, the average WSR increases as M 
increases. For the low-complexity approach with a fixed M, 
the average WSR increases as L increases, and is close to 
that for the GP approach when i > 4. Most interestingly, the 
low-complexity approach using L = 1 and 71/ = 8 is a good 



option to implement the SCALE algorithm in practice, since 
it has a fast speed and its average WSR performance is close 
to that for the GP approach. 

VII. Conclusion and future work 

We have presented the geometric interpretation and prop- 
erties of the SCALE algorithm. A GP approach has also 
been developed to implement the algorithm, and an analytical 
method to set up the lower-bound constraints added by a GP 
solver Numerical experiments have been shown to illustrate 
the analysis and compare the GP approach and the low- 
complexity approach proposed previously. 

In future, the following aspects can be further investigated. 
First, a theoretical study can be made on the convergence 
of the inner iterations for the low-complexity approach, as 
well as the behavior of the low-complexity approach using a 
small inner iteration number. Second, we can study how to 
generalize the SCALE algorithm for multiple-input-multiple- 
output systems, and compare it with other algorithms 1211 . 
II22I . Third, it is important to note that the SCALE algorithm 
iteratively produces increasingly better $ through solving (|2]i 
for the globally optimum Q e Cq and then transforming it 
back to Another possible way is to solve dU directly for 
the globally optimum $ e C$, e.g., by using an analytical 
formulation of C*. Works along this direction have been done 
in ifTTl - lfTSl for the single-carrier case. It is interesting to 
check how to extend those studies for the multicarrier case. 
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Appendix 

A standard-form GP problem is expressed as 
min /o(x) 

X 

s.t. /,(x)<l,j = l,...,J, (11) 
Xi>0,i = I,..., I, 

where V j = 0, 1, • • • , /j(x) is a posynomial of x with [x]i = 
Xi. Specifically, an example posynomial of x is expressed as 
= J2a=i '"a(^) with ita(x) = JlLi ^7" ^here Pa > 
and aai (« = 1, • • ■ ,1) are real constants. It is very important 
to note that s(y) = log (p(e^)) is convex of y. Therefore, 
although a standard-form GP problem in (fTTT i is nonconvex, 
it can be converted by making the logarithmic transformation 
from X to y satisfying x = to its equivalent convex form 

min .9o(y) log (/o(e*')) (12) 
y 

s.t. g,{y) - log {f,{e^)) < 0,j = 1, J, 

which is then solved by a GP solver, e.g., gpcvx or MOSEK 
based on state-of-the-art IPM. 

An important subtlety should be noted. When all globally 
optimum x for (fTTT i contains zero entries, the corresponding 
optimum y for (fTZt contains entries equal to — oo. In such a 
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case, the IPM iteratively outputs y with go{y) asymptotically 
approaching the optimum objective value for ( fT2] i. which leads 
to an overflow in the processor running the GP solver that 
implements the IPM. It is rarely known a priori if the optimum 
X for ( fTTT i contains zero entries. To avoid overflow, the GP 
solver by default adds entrywise lower-bound constraints on x, 
or equivalently on y before solving fT% (see page 3 of 1201 ). 
These constraints should be set to ensure that the optimum x 
for ( fTTT ) with the extra constraints corresponds to an objective 
value within a prescribed small tolerance around the original 
optimum objective for ( fTTI ). 
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